Track down faults in your memory hoards , 




m 



Here is a memory diagnostic routine for your 2650 Mini Computer. 
It will exercise each and every bit in a specified memory range with 
four distinct tests, and produce a printout of any faulty locations. It 
can also be used to track down intermittent faults. 


by DAVID EDWARDS 


Memory testing can be a very tedious 
and time consuming process, so most 
operators of small systems simply 
assume that all is OK, and get on with 
writing programs. But when a program 
you have triple checked and are sure is 
OK fails to operate correctly, you start 
to wonder about your memory. 

Ninetynine times out of a hundred, 
of course, the memory is working cor¬ 
rectly, and the bug is in your program 
(moral: check, check and check again, 
and if you can get a second opinion, do 
so!). But what do you do if the program 
still refuses to operate correctly? 

Well, you can always employ the old 
standby, the walking finger test. This in¬ 
volves placing your index finger in turn 
on all of the memory chips. The chip 
(or chips) that sends you running to the 
first aid cabinet is then faulty. Don't 
laugh, this does work, and I have used 
it in the past. 

t Mut this test will not show up faults 
':ke open circuit address or data lines, 
or short circuits between adjacent PCB 
tracks. This type of fault is quite com¬ 
mon on large memory boards, as they 


have more and more memory cramm¬ 
ed onto them. 

In these situations, what is required is 
some sort of software test routine 
which will exercise all memory 
locations of interest, and provide clues 
as to where the fault is. This is the func¬ 
tion of the program described in this 
article. 

The tests described here are based 
on those presented by Charles E. Cook, 
in the October 1977 issue of the US 
magazine, "Kilobaud”. Two of the tests 
are quite simple, and check that each 
location can store and read back both a 
null (X'OO and a delete (X'FF). 

The third test is known as the "walk¬ 
ing bit test", and is perhaps the most 
important test. It verifies the 
"changeability" of each bit of the test 
location, by storing first the pattern 
00000001, then 00000010, and so on up 
to 10000000, each time checking that 
only the correct pattern can be read 
back from the memory. The test bit (the 
1) has been "walked" through the test 
byte. 

The fourth test is really a combina- 


0440 
0 450 
0 46 0 
0 47 0 
0 48 0 
0 49 0 
04A0 
04B0 
0 4C0 
04 D0 
0 4E0 
04F0 


09 IE 
0D 16 
00 00 
04 4C 
3B FI 
40 7 7 
D9 00 
3F 04 
60 BC 
06 80 
CC 8 4 
IE 04 


0A ID 
EA 0B 
04 5 A 
3F 02 
FB 05 
02 75 
CD 04 
55 20 
04 6 2 
CE 8 4 
60 EC 
E0 FA 


DA 02 

17 09 
IB 0C 
B4 09 
3F 00 

18 3F 
5E CE 
CC 84 
3B F2 
6 0 EE 
84 60 
00 CA 


D9 00 
05 0A 
04 53 
69 3F 
8A 07 
02 DB 
04 5F 
60 3F 
1A 76. 
84 6 0 
BC 04 
F7 5 A 


C9 16 
04 3B 
IB 08 
02 69 
05 04 
C9 4 2 
3B EC 
04 4D 
3B E7 
BC 04 
70 3B 
F8 9 B 


CA 15 17 
6D 17 00 
06 40 04 
09 65 3B 
01 C8 01 
CA 41 3B 
CA 62 33 
1A 77 3B 
0C 84 60 
6A D2 9 A 
CD 1A 5D 
22 


This is a hexadecimal listing of the 
disassembler program to produce 


3B 71 E9 
00 00 00 
57 IB 02 
FA 04 20 
17 00 76 
F8 DA 02 
D7 07 05 
F3 0C 84 
BC 04 66 
74 04 FF 
0E 04 8 E 
FIG. 2 


2650 memory test program. You can use the 
a mnemonic listing of it. 


tion of the three earlier tests. The 
whole of the test area of memory is first 
cleared, and then tested for correct 
clearing (this is the first test). Next, the 
walking bit test and the delete test are 
performed on the first test location. 
Then before these two tests are carried 
out on the second location, it is tested 
to see if it is still zero. If it is not, then 
there is obviously a memory fault of 
some type or other. 

This process is repeated in turn 
throughout the test memory area, and 
forms the fourth test. 

In order for the operator to be able 
to use these test results, it is necessary 
to know not only the type of faults en¬ 
countered, but also their locations. To 
simplify matters, we have called the first 
test the Z test, the second the L test, the 
third the W test, and the fourth the S 
test. Then all the program has to do is 
print out the code letter of the test, 
followed by the appropriate address. 

A flowchart for the basic test routine 
is shown in Fig. 1. Test S is carried out at 
the start of the main loop. The failure 
sections incorporate the error message 
printing routines, and produce a listing 
five entries wide, which can be accom¬ 
modated on a 32 character-per-line 
VDU. 

If the test routine is run once, it will 
catch and record all permanent faults, 
but is unlikely to give any indications of 
intermittent faults. To catch this type of 
fault, we must repeat the basic test 
routine a large number of times. 

It would be wise, of course, to 
arrange that once a fault has been 
detected, that the program stops at the 
end of the current basic test. If this is 
not done, then,there is a fair chance 
that you will be rewarded with a great 
screed of endlessly repeated error 
message sets, whereas only one set is 
required. 

The complete program, incor- 
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orating all of these points, is given as a 
exadecimal listing in Fig. 2. It occupies 
locations X'440 to X'4FA inclusive, and 
is not easily relocated. It uses PIPBUG 
routines COUT, BOUT, CRLF and 
GNUM. 

To be able to use this program, the 
memory area it occupies must be work¬ 
ing correctly, and so must the 
processor. If you are not sure about 
this, try it anyway; if it works, then all is 
OK. Otherwise, you will have to do 
some fault-finding and corrections first. 

To call the program, type G48F XXXX 
YYYY ZZ cr, where X is tne start address 
of the memory range to be tested, and 
Y is the end address. Remember that 
the existing contents of the test area 
will be destroyed, and that you cannot 
test the area of memory occupied by 
the test program. 

The parameter Z determined how 
many basic tests are to be carried out. 
X'01 gives one test, X'02 gives two, and 
so on up to X'7F, which produces 127 
tests. All negative numbers such as X'80 
and X'FF, produce an unlimited 
number of tests, terminated only when 
an error is detected. 

The first time you use the program, 
specify only one test. Any errors you 
get will almost certainly be permanent 
faults, and should be found and cor¬ 
rected first. Only when this has been 
done should you attempt to trace inter- 
mittents using multiple tests. 

In these initial tests, it may be advan¬ 
tageous to test only small amounts of 
memory at a time, say IK blocks. This 
will allow you to isolate any faults more 
rapidly. 

At this stage, you are probably 
wondering what all the rather strange 
lists of error locations mean, and how 
they can be used to locate faults in your 
memory. Well, simple faults should 
show up as easy to understand patterns. 

For instance, if a data line to a par¬ 
ticular chip is open, then all locations in 
this chip should fail the W and L tests. 
Similarly, if an address line to a 
particular chip is open, then we would 
expect test S to fail at all locations 
where this address line would normally 
go high. This is because the open line 
will normally float high, so that when 
we address lower bytes, we will actually 
write into higher locations, and will get 
an S message when we do address these 
bytes. 

Further information on the types of 
faults which can occur in memory, and 
the results they produce with our test 
program, can be obtained from Cook's 
article. In any case, you will have to play 
at being a detective, and apply a little 
deductive reasoning. 

Finally, a few detailed comments on 
the program for those who may wish to 
modify it. The start, end and current ad¬ 



INITIAUSE 
TEST ADDRESS 


dresses are stored in locations X'45C to 
X'461, while the number of tests is 
stored in location X'48E. The number of 
error messages on each line is specified 
in locations X'488 and X'4AF. 

To remove the auto-stop facility 
when errors occur, change locations 
X'48B and X'48C to the NOP code, X'CO. 
If you wish to obtain an error message 
every time the walking bit test fails, 
rather than just once for each walking 
bit test, change locations X'46A and 
X'46B to NOPs. 



By changing locations X'4ED to X'4F2 
inclusive to NOPs, you can delete the 
repeat forever facility, and obtain a 
maximum of 256 basic tests (specify X'OO 
in the calling line). 

In conclusion, I wish you happy fault 
hunting, and successful debugging of 
your own programs. Because once you 
nave assured yourself that your 
memory is OK, then you realise that the 
reason your program won't run cor¬ 
rectly is because you have written a bug 
into it! & 
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FIG. 1 BASIC TEST ROUTINE 


Readers with systems based on CPUs other than the 2650 can use this flowchart to 
write their own diagnostic routines. 
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